Preserving Privacy in Data Mining using hybrid of Auto-Associative Neural Network and Particle Swarm Optimization: An application for bankruptcy prediction in banks
نویسنده
چکیده
Data mining has emerged as a significant technology for gaining knowledge from vast quantities of business data, financial data, networked data and medical data. The goal of data mining is approaches are to develop generalized knowledge rather than identify specific information against specific individual. There has been growing concern that use of this technology is violating individual privacy. This is opening new challenges in the area of Privacy Preserving in Data Mining (PPDM). Privacy regulations and other privacy concerns may prevent data owners from sharing information for performing data analysis. To achieve a solution to this problem, data owners must design a strategy that meets privacy requirements and guarantees valid data mining results. This paper discusses the possibility of using neural network and Particle Swarm Optimization (PSO) algorithm for Preserving Privacy in Data Mining. The above task is carried on five benchmark data sets and four bankruptcy data sets. This paper presents methods where by the privacy and secrecy of a bank related sensitive data is taken care of on one hand and the resulting dataset is mined without a considerable loss in accuracy obtained in models. Multi Layer Perceptron, decision tree J48 and Logistic Regression are used as classifiers for illustration purpose. This paper gives the experimental results how the bankruptcy prediction can be done for various bank datasets while protecting the sensitive information of the banks. The results have been compared with the other methods used for Preserving Privacy in Data Mining namely random
منابع مشابه
S3PSO: Students’ Performance Prediction Based on Particle Swarm Optimization
Nowadays, new methods are required to take advantage of the rich and extensive gold mine of data given the vast content of data particularly created by educational systems. Data mining algorithms have been used in educational systems especially e-learning systems due to the broad usage of these systems. Providing a model to predict final student results in educational course is a reason for usi...
متن کاملTraffic Signal Prediction Using Elman Neural Network and Particle Swarm Optimization
Prediction of traffic is very crucial for its management. Because of human involvement in the generation of this phenomenon, traffic signal is normally accompanied by noise and high levels of non-stationarity. Therefore, traffic signal prediction as one of the important subjects of study has attracted researchers’ interests. In this study, a combinatorial approach is proposed for traffic signal...
متن کاملModeling heat transfer of non-Newtonian nanofluids using hybrid ANN-Metaheuristic optimization algorithm
An optimal artificial neural network (ANN) has been developed to predict the Nusselt number of non-Newtonian nanofluids. The resulting ANN is a multi-layer perceptron with two hidden layers consisting of six and nine neurons, respectively. The tangent sigmoid transfer function is the best for both hidden layers and the linear transfer function is the best transfer function for the output layer....
متن کاملPREDICTION OF EARTHQUAKE INDUCED DISPLACEMENTS OF SLOPES USING HYBRID SUPPORT VECTOR REGRESSION WITH PARTICLE SWARM OPTIMIZATION
Displacements induced by earthquake can be very large and result in severe damage to earth and earth supported structures including embankment dams, road embankments, excavations and retaining walls. It is important, therefore, to be able to predict such displacements. In this paper, a new approach to prediction of earthquake induced displacements of slopes (EIDS) using hybrid support vector re...
متن کاملApplication of Multi Objective HFAPSO algorithm for Simultaneous Placement of DG, Capacitor and Protective Device in Radial Distribution Network
In this paper, simultaneous placement of distributed generation, capacitor bank and protective devices are utilized to improve the efficiency of the distribution network. The objectives of the problem are reduction of active and reactive power losses, improvement of voltage profile and reliability indices and increasing distribution companies’ profit. The combination of firefly algorithm, parti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010